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1 Introduction 

We propose a refinement of the usual Ehrenfeucht-Fraisse game. The new 
game will help us make finer distinctions than the traditional one. In partic- 
ular, it can be used to measure not only quantifier rank but also lengths of 
conjunctions and disjunctions needed for expressing a given property. Our 
game is similar to the game in [I] and in [5]. 

The most common measure of complexity of a first order sentence is 
its quantifier rank, and the method of Ehrenfeucht-Fraisse games can be 
successfully used to prove lower bound results for this measure of complexity. 
However, the number of non-equivalent first order formulas of quantifier rank 
n is an exponential tower of height n with a polynomial depending on the 
vocabulary on the top. Thus we are not very close to knowing the formula 
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if we merely know its quantifier rank. In this paper we measure complexity 
in terms of the number of symbols in the formula. The advantage of our 
measure is that there are only 2 P ^ non-equivalent formulas with n symbols, 
where P(x) is a polynomial depending on the vocabulary. 

Let us consider the question of the complexity of deciding whether a 
binary string 

sis 2 ... s n = 10110010.. .01 (1) 

has a certain pattern. One approach to this is circuit complexity. For exam- 
ple, it is proved in [3) that constant depth circuits cannot decide the parity 
of (0Q), i.e. the question whether ([[]) has an even number of ones. Another 
approach is computational complexity, where the parity of ([TJ is easily de- 
cided in linear time. In so called descriptive complexity the question is posed, 
what is the logical complexity of the simplest formula in a given logic that 
expresses the property of <^ in question, for example parity. Concerning this 
type of problems, it was proved in [2] that the question whether a set R of 
n + 1-sequences 

(ax, . . . ,a n+ i) (2) 

from a finite set {1, . . . ,m} has an even cardinality cannot be expressed in 
existential second order logic, where the second order variables range over 
< n-ary relations, that is, there is no sentence of such existential second 
order logic with the property that the structure ({1, . . . , m}, R) satisfies if 
and only if R has even cardinality. 

Let us return to ([T]). A natural setup of using logic to study the question 
of complexity of finding patterns in (JT|) is to use the propositional logic with 
propositional symbols 

pi,p 2 ,...,p n (3) 

with the meaning 

Pi is true if and only if = 1. (4) 

Propositional logic gives rise to a variety of complexity measures. We focus 
here on the simplest of them, namely the number of occurrences of proposi- 
tional symbols in a formula. Up to a constant factor this is the same as the 
number of symbols, that is, the length of the formula. 

The parity of ([T]) is naturally expressed by the propositional formula 

n 

\/{f\p?:AC{l,... } n} } \A\even} } (5) 
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where 

A = (Pi if*e^ 

which gives the upper bound |n2 n to the size of the smallest formula ex- 
pressing parity. However, there is a shorter formula if) defined as follows: 
if) = (f>i in , where for 1 < i < j < n and k = \_^-\ 

4>i,j = {4>i,k A 4>k+i,j) v A -xpk+l,j) 

and 

4>i,i = ->Pi- 

This gives (up to a constant factor) the upper bound n 2 for the shortest 
propositional formula expressing parity^. Krapchenko (see P, p 258]) has 
proved in 1971 that this is optimal. We prove below the same result using 
our refinement of the Ehrenfeucht-Fraisse game. 

For a slightly different complexity question, suppose we have n unary 
predicates: 

P 1 (ar),...,P n (x). (7) 

We ask what is the length of the shortest formula in first order logic with 
the unary predicates (JTj) that expresses the property that every Boolean 
combination of the predicates is non-empty. This property can be written as 
the existential sentence: 

n 

/\{3x/\Pf(x):AC{l,...,n}}, (8) 

i=i 

where, as above in P^(x) = Pi(x) if % G A, and Pf(x) = ->Pi(x) if % $ A. 

This gives the upper bound (n + l)2 n to the size of the smallest sentence 
expressing the given property (we define the size of a formula to be the total 
number of atomic subformulas and quantifiers occurring in it). In Section 13 
we show that this is optimal for existential sentences. If we give up existential 
sentences and allow a quantifier alternation, we have a smaller sentence <f> 
defined as follows: 

n 

Vxly /\(Pi(x) P i+1 (y)) A 

i=l 

1 More precisely, a straightforward induction shows that the size of ip is at most (n + 1) 2 , 
and if n is a power of 2, then the size of ip is exactly n 2 . 
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Vx3y(/\(P i (x) ^ Pi(y)) A (Px(x) <-> -Pi(y))), 

i=2 

where addition z + 1 is to be calculated modulo n. The size of is only 
8n + 4, so this is really optimal (up to a constant factor) because we cannot 
have such a <fr without mentioning each Pj at least once. 

In our final application, we consider the minimal size of a first-order 
sentence expressing that the length of a linear order is at least n. It is 
well known that this can be expressed by a sentence with quantifier rank 
[logn] + 1. Such a sentence 4> n can be obtained as follows: define first 
recursively formulas 9k(x,y) by letting 9%(x,y) := x < y and 9k(x,y) : = 
3z{9i(x,z) A9 m (z,y)), where I = [k/2\ + 1 and m = \k/2], for k > 2. Thus, 
9k{x,y) says that there are at least k elements in the interval [x,y]. Then n 
can be defined as 3x3y9 n (x, y). 

Clearly the size of n is linear in n. However, with a clever trick of 
recycling quantified variables it is possible to define a variant 4>' n of <fi n that 
has size cflogn], where c is a small constant. Moreover, <j)' n can be chosen to 
be in the 4- variable logic FO4 (see [I], Section 5). On the other hand, Grohe 
and Schweikardt prove in [I] that the minimal size of an equivalent sentence 
in FO3 is at least \\Jn. Their proof is based on a method which is related to 
the game we study in this paper, but it is not formulated in a game theoretic 
form. 



2 A Game for Propositional Logic 

We introduce now a game for measuring how long a propositional formula 
has to be to express a given property of binary strings 

s = st . . . s n , (9) 

where Sj G {0, 1}. We denote strings by s,r,t etc. Let n be the fixed length 
of the strings considered. We shall discuss nonempty sets S of such strings 
and call them string properties. For example, S could be the set of strings 
(Q where the cardinality of the set {i £ {1, . . . , n} : Sj = 1} is even. Another 
example is the set of strings (Q with more ones than zeros. In an extreme 
case S can be a singleton, e.g. the singleton constant one string {111 ... 1}. 

Propositional formulas are built up from propositional symbols ()3]) by 
means of -1, A and V. We use Greek letters <p,ip etc to denote propositional 
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formulas. The concept s |= of a string s satisfying a propositional formula 
is defined in the usual way: s \= Pi if Sj = 1; s |= -i0 if s ^ 0; s \= A -0 if 
s |= and s|=^;s|=0V'0ifs|=0ors|='0. 

Definition 1 Suppose cf) is a propositional formula and S a string property. 
We say that defines S if for all strings s 

s G S -<=>- s |= 0. 

We define the size w(0) of the propositional formula as follows: 

w(pi) = 1 

w(-i0) = W(0) 
W(0V'0) = w(0) + w("0) 
w(0A-0) = w(0)+w('0)- 

Note that w(0) is just the number of occurrences of propositional symbols in 
0. Up to a constant factor, this is the same as the number of symbols in 0. 

Note that while there are, up to logical equivalence, 2 2 " propositional 
formulas over the propositional symbols pi, . . . ,p n altogether, there are for 
each m, up to logical equivalence, only at most 2 m • (n + 2) 2m propositional 
formulas with w(0) < m. 

A natural question now is: 

Question: Given a string property S, what is the size of the 
smallest propositional formula that defines S. 

We shall define a game for the study of this question. In defining the game 
we follow mostly [T] and to a lesser degree [S]. This game resembles the usual 
Ehrenfeucht-Frai'sse game for the first order logic, but also the Ehrenfeucht- 
Frai'sse game for the independence friendly logic, presented in [5]. 

In the usual Ehrenfeucht-Frai'sse game we have two structures 21 and 03 
and the game is about their similarity. Player II maintains that the structures 
are very similar. Player I maintains that there is a difference. During the 
game player I picks elements from either one of the models and player II tries 
to find similar elements in the other model. 

In the new game we do not have just two models but instead two classes 
of models. Player II maintains that whatever features the models in one 
class have all in common, some member of the other class also has. The 
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game to be defined is able to detect very minute differences in models and is 
in that sense finer than the usual Ehrenfeucht game. In particular, this game 
can detect differences in the size of conjunctions and disjunctions needed for 
separating the classes of models. 

Definition 2 Let S and R be string properties and w a positive integer. 
The game FF W (S, R) has two players. The number w is called the rank of 
the game. In the beginning the position is (w,S,R). Suppose the position 
after m moves is (w m , S m , R m ). There are the following two possibilities for 
the continuation of the game: 

Left splitting move: Player I first chooses numbers u and v such that 1 < 
u,v < w and u + v = w. Then player I represents S m as a union 
CUD. Now the game continues from the position (u, C, R m ) or from 
the position (v, D, R m ), and player II can choose which. 

Right splitting move: Player I first chooses numbers u and v such that 
1 < u, v < w and u + v = w. Then Player I represents R m as a union 
C U D. Now the game continues from the position (u, S m , C) or from 
the position (v, S m , D), and player II can choose which. 

The game ends in a position (w m , S m , R rn ) and player I wins if there is a 
propositional symbol pi such that either s \= Pi for all s G S rn and r \/= for 
all r G R m , or s Pi for all s G S m and r \= pi for all r G R m . Player II 
wins the game if they reach a position (w m , S m , R m ) such that w m = 1 and 
player I does not win in this position. 

This is a game of perfect information and the concept of winning strategy 
is defined as usual. Since the rank w m decreases in each move, the game 
always ends in a finite number of moves. Hence the game is determined: 
exactly one of the players has a winning strategy in EF W (S, R). 

Note that if the players have reached a position (w m , S m , R m ) in the game 
EF W (S, R), then the continuation of the game from that position onwards can 
be seen as a play of the game EF Wm (S m , R m ). Thus, player I (player II) has a 
winning strategy in the game EF W (S, R) from position (w m , S m , R m ) onwards 
if an only if player I (player II, respectively) has a winning startegy in the 
game EF Wm (S m ,R m ). 

We say that a propositional formula <fi separates string properties S and 
R, in symbols (S, R) \= 0, if s \= <j) for all s G S and r ^ for all r G R. 
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Theorem 3 Suppose (S, R) is a pair of string properties, and let w be a 
positive integer. Then the following conditions are equivalent: 

(l) w Player I has a winning strategy in the game EF W (S, R). 

{2) w There is a propositional formula of size < w such that (S, R) |= 0. 

Proof. We prove the equivalence of (l) w and (2)^, by induction on w. Con- 
sider first the case w = 1 . By Definition [21 there are no moves in the game 
EFi (S,R), and player I wins if and only if there is a proposition symbol pi 
such that either (S,R) \= Pi, or (S,R) \= Since w(pi) = w(-ip») = 1, we 
have (l)i ==>- (2)i. On the other hand, if w(0) < 1, then is, up to logical 
equivalence, either a proposition symbol, or the negation of a proposition 
symbol. Thus, (2)i implies (l)i. 

Let us then consider the case w > 1, and assume (1)„ •<=>■ (2)„ for all 
v < w as an induction hypothesis. To prove (l) w •<==>■ (2) TO , assume first 
that player I has a winning strategy in the game FF W (S, R). We have the 
following two cases according to the type of the first move in the winning 
strategy of player I. 

Case 1: Player I makes a left splitting move by choosing u, v, C and D 
such that 1 < u,v < w, w = u + v and S = C U D. Since this move is given 
by a winning strategy, player I has a winning strategy in both of the games 
EF U (C, R) and EF V (D, R). By induction hypothesis, there are formulas ip 
and 6 such that w(^) < u, w(0) < v, (C, R) \= ip and (D, R) \= 9. Thus, 
s \= ip for all s G C and s \= 6 for all s G D, whence s \= ip V 6 for all s G S. 
On the other hand, for all r G -R, r ^ ip and r ^ 6 1 , whence consequently 
r^V«. It follows that (S, R) (= -0 V 0. As w(^ V 0) = w(V>) + w(0) < 
m + f = w, (2) M is true. 

Case 2: Player I makes a right splitting move by choosing u, v, C and D 
such that 1 < u,v < w, w = u + v and R = C U D. Since this move is given 
by a winning strategy, player I has a winning strategy in both of the games 
EF U (S, C) and EF„(5', D). By induction hypothesis there are formulas ip and 
9 such that w(if)) < u, w(9) < v, (S, C) |= ip and (S, D) |= 6*. Thus, for all 
s G S, s |= -0 and s |= whence s |= ip A 0. On the other hand, r ^ ip for 
all r G C and r # for all r £ D, whence r \/= AO for all r E R. It follows 
that (S, R) (= V A 6». As w(V> A 6») = w(^) + w(6») < u + u = io, (2)^ is true. 

To prove the converse implication (2) w ==>• (l) w , assume that there is a 
formula <fi of size < w such that (5*, i?) |= 0. We show that then player I 
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has a winning strategy in the game EF W (S^R). We assume without loss of 
generality that is in negation normal formj. The strategy depends on as 
follows. 

Case 1: is a proposition symbol or the negation of a proposition symbol. 
Then by the definition of the game EF„,, player I wins without making any 
moves. 

Case 2: is ip V 9. Let C be the set of strings s G S such that s \= ip, and 
let D be the set of strings s G S such that s \= 9. Since s |= for every 
s G S, we have S = C U D. Moreover, since r for every r G -R, we have 
(C, R) |= -0 and (.D, R) \= 9. Finally, as w(0) < w, there are u and t> such 
that w = u + v, w(-0) < it and w(#) < t>. Note that u,v < w, and so, by 
induction hypothesis, player I has a winning strategy in the games EF U (C, R) 
and EF V (D, R). Thus, using w = u + v, S = C U D as his first move and 
these winning strategies in the rest of the game, player I is guaranteed to win 
the game EF„,(S', R). 

Case 3: is ip A 9. Let C be the set of strings r G R such that r ^ ip, and 
let .D be the set of strings r G -R such that s \/= 9. Since r ^ for every 
r G -R, we have R = C U D. Moreover, since s |= for every s G S", we have 
(S, C) |= -0 and (S, D) |= 0. Finally, as w(0) < w, there are u and t> such 
that w — u + v, w(-0) < u and w(#) < t> . Note that u,v < w, and so, by 
induction hypothesis, player I has a winning strategy in the games EF U (S', C) 
and EF^(5', D). Thus, using w = u + v,R = CUD in his first move, and 
these winning strategies in the rest of the game, player I is guaranteed to win 
the game EF„,(S', R). 

Q.E.D. 

Corollary 4 Suppose K is a string property. Then the following conditions 
are equivalent: 

(1) K is definable by a propositional formula of size < w. 

(2) Player I has a winning strategy in the game EF W (S, R) for all S and R 

such that S <ZK and RnK = 0. 

2 A formula is in negation normal form, if all negations occurring in it are in front of 
proposition symbols. A straightforward induction shows that for any formula <fi there is 
an equivalent formula <f>' in negation normal form such that w(<//) = w(4>). 
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Proof. Suppose K is denned by of size < w. Let S and R be given so that 
S C If and -R fl /C = 0. Thus every model in S 1 satisfies and no model in 
R does, whence (5, i?) |= 0. By Theorem [31 player I has a winning strategy 
in EF W (S, R). For the converse, suppose player I has a winning strategy in 
EF W (5', R) for all (S, R) with S C /C and RC\ K. This holds in particular if 
S" is if and is the complement of if. Thus for this choice of S and R we 
have (5*, R) \= for some of size < u;. Thus is defined by 0. Q.E.D. 

We get the following method for showing that a string property K is not 
definable by a propositional formula of size < w. We find classes S and R 
such that 

1. SCif. 

2. RnK = ®. 

3. Player II has a winning strategy in EF lu (S', R). 



3 The Propositional Complexity of Parity 

We shall now prove that the parity of a binary string of n bits cannot be 
expressed with a propositional formula smaller than n 2 . The result has been 
proved in j6] with a method which is very similar to ours, even if it does not 
explicitly use games. We present the proof as an example of the use of our 
game. 

Suppose 5* and R are disjoint sets of binary strings of length n. Let 
E = {{f,g)eSxR:\{i:f i ^g i }\ = l}. 
The density of the pair (S, R) is defined to be the pair 

D(S,R) = (s,r), 

where 

\E\ , \E\ 

S = W andr = M' 

Lemma 5 // s > 1 or r > 1, then there is no proposition symbol pi such 
that (S,R) \= Pi or (S,R) \= 
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Proof. Suppose s > 1 and i G {1, ...,n}. Then there are / G S and 
g,h £ R such that (f,g),(f,h) G Clearly this means that /, = ^ or 
fi = hi, whence / |= pi <^=^ g \= p { or / |= p t <t==^ h \= p t . Thus, if 
all strings in S satisfy pi (->Pi), then there is a string in R satisfying pi (->Pi, 
respectively). It follows that neither p iy nor ->pi separates the sets S and R. 

In the case r > 1 there are f,g G S and h G R such that (/, h), (g, h) G i?. 
Then we have hi = f\ or hi — gi, whence h \= Pi •<=>■ / |= Pi or h \= Pi -<=>• 
<7 |= pi. As above, we conclude that neither p i: nor -ipi separates the sets S 
and R. Q.E.D. 



Lemma 6 (a) Suppose D(S, R) = (s,r) and S = SoUSi, where So fl Si = 0. 
Let D(S , R) = (s ,r ) and D(Si, R) — (si, T\). Then s r + s^ri > sr . 

^ Suppose D(S, R) = (s, r) and i? = i? U where RqH Ri — 0. Let 
D(S, R ) = (s ,r ) and D(S, Ri) — (si,ri). Then s r + siri > sr . 

Proof. We prove (a); the proof of (b) is similar. Clearly ro + r\ = r and 
— + — = r . Since the harmonic mean is never greater than the arithmetic 

so A s o 

mean_|, we have 

s r + s 1 r 1 = s r + sin r + r\ _ r _ 
r ~ r + n -ro + ii~r~ s ' 

«0 Si s 

and the claim follows. Q.E.D. 



Lemma 7 Suppose D(S,R) = (s,r) and w < sr. Then player II has a 
winning strategy in EF W (S, R) . 

Proof. The proof is by induction on w. In the case w = 1, the assumption 
w < sr implies that either s > 1 or r > 1. By Lemma there is no 
proposition symbol Pi such that (S,R) \= Pi or (S,R) \= -ip*. Since the 
game EF 1 (S', i?) ends in its initial position (1,S,R), this means that player 
II automatically wins this game. 

Assume then that w > 1. Using Lemma [5] again, we see that player 
I does not win the game EF W (S,R) without making moves. Suppose then 
that player I makes a left splitting move w = u + v and S = C U D. Let 
So C C and Si C D be sets such that S — So U Si and So fl Si = 0. 

3 We are indebted to Kerkko Luosto for pointing this out. 
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Let D(S ,R) = (s ,r ) and D(Si,R) = (s 1; ri). Then by Lemma E](a), 
■So r o + s i r i > sr - Thus s r o > u or > t>, for otherwise 

sr < s r + S\r\ < u + v = w, 

contrary to the assumption. If soTq > u, then by induction hypothesis, player 
II has a winning strategy in the game EF^So, R), whence, a fortiori, he has 
a winning strategy in EF U (C, R). Otherwise, s\ri > v, and by the same 
argument, player II has a winning strategy in the game EF V (D, R). Thus, in 
any case, player II can make his move in such a way that he is guaranteed 
to win. 

The case of a right splitting move w = u + v and R = C U D is proved in 
the same way by using Lemma [6](b). Q.E.D. 

We are now ready to prove the promised lower bound for parity of binary 
strings. 

Corollary 8 ([5]) If (j) is a propositional formula expressing the parity of 
strings s G {0, l} n , then the size of <fi is at least n 2 . 

Proof. Let S be the set of all strings / G {0, l} n such that \{i : fi = 
1}\ is even, and let R be the complement of S. Thus, \S\ = \R\ = 2 n ~ 1 . 
Furthemore, for each string / G S, there are n different strings g G {0, l} n 
such that \{i : fi gi}\ = 1, and all these strings are in R. Thus, we have 
|j£| _ 2™~ 1 n, and consequently D(S,R) = (n,n). By Lemma [7J player II 
has a winning strategy in the game EF W (S,R) whenever w < n 2 , and we 
conclude that S is not definable by any formula of size less than n 2 . Q.E.D. 

As noted in the introduction, the parity of strings in {0, 1}™ can be ex- 
pressed by a formula of size at most (n + l) 2 . Thus, the lower bound n 2 
cannot be essentially improved. 

4 A Game for Predicate Logic 

We shall next define a game that can be used for measuring the size of a 
first-order sentence needed for expressing properties of models. For the sake 
of simplicity, we will only consider models with relational vocabulary. 

We need to fix some notation first. The universe of a model 21 is denoted 
by A, of 23 by B, etc. We use Xj, j G N, to denote variables. A variable 
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assignment for a model 21 is a finite partial mapping a : N — > A. The finite 
domain of a is denoted by dom(ai). If is a formula, then (21, a) \= means 
that the assignment a satisfies the formula in the model 21. For this to be 
meaningful the domain of the assignment a has to include all the j for which 
the variable Xj is free in 0. 

We shall discuss classes A of structures (21, a), where 21 is a model and a is 
an assignment. We assume that whenever (21, a), (23, f$) G A, then 21 and 23 
have the same vocabulary, and a and /3 have the same domain, which we de- 
note by dom(A). If a is an assignment on 21, a G A and j G N, then a(a/j) is 
the assignment that maps j to a and agrees with a otherwise. If F is a choice 
function on A, that is, F is a function defined on A such that F((2l, a)) G A 
for all (21, a) G A, then -4(F/j) is defined as {(21, a(F((2l, a))/j) : (21, a) G 
.A}. Finally, -4(*/i) = j)) : (St, «) £ A,a e A}. 

Let ^4 and B be classes of structures of a fixed relational vocabulary. 
Assume further that dom(«4) = dom(£>), and is a formula such that j G 
dom(^4) for all variables Xj which are free in 0. As in the case of propositional 
logic, we say that separates the classes A and B, (A, B) \= 0, if (21, a) \= 
for all (21, a) G A and (05, 0) tf= for all (23, p) G B. 

Definition 9 Let A and B be classes of structures of the same relational 
vocabulary with dom(^4.) = dom(£>) 7 and letw be a positive integer. The game 
EF W (A, B) has two players. The number w is called the rank of the game. In 
the beginning the position is (w,A,B). Suppose the position after m moves 
is (w m , A m , B m ) , where dom(^4 m ) = dom(£> m ). There are the following four 
possibilities for the continuation of the game: 

Left splitting move: Player I first chooses numbers u and v such that 1 < 
u,v < w and u + v = w m . Then Player I represents A m as a union 
CUD. Now the game continues from the position (u,C,B m ) or from 
the position (v, V, B m ), and player II can choose which. 

Right splitting move: Player I first chooses numbers u and v such that 
1 < u, v < w and u + v = w m . Then Player I represents B m as a union 
CUT>. Now the game continues from the position (u, A m ,C) or from 
the position {y,Am,T)), and player II can choose which. 

Left supplementing move: Player I chooses a natural number j and a 
choice function F for A m . Then the game continues from the position 
(w m -l,A m {F/j),B m (*/j)). 
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Right supplementing move: Player I chooses a natural number j and a 
choice function F for B m . Then the game continues from the position 
(w m -l,A m (*/j),B m (F/j)). 

The game ends in a position (w m , A m , B m ) and player I wins if there is an 
atomic or a negated atomic formula <p such that (A m , B m ) \= <p. Player II 
wins the game if they reach a position (w m , S m , R m ) such that w m = 1 and 
player I does not win in this position. 

This is a game of perfect information and the concept of winning strategy 
is defined as usual. The game is determined by the Gale-Stewart theorem. 
We define the size w(0) of the formula of predicate logic as follows: 



w(4>) = 1 for atomic <fr 

w(-i0) = w(0) 

w(0V^) = w(0) + w(V') 

w(0A , 0) = w(0) + w(V>) 

w{3xj(p) = w(0) + 1 

W(\/Xj(fi) = w(0) + 1 

Note that there are for each w, up to logical equivalence, only finitely 
many formulas of size < w. 

Theorem 10 Suppose (A, B) is a pair of classes of structures of the same 
vocabulary, and let w be a positive integer. Then the following conditions are 
equivalent: 

(1) w Player I has a winning strategy in the game EF W (A, B). 

(2) „, There is a formula <fi of predicate logic of size < w such that (A, B) \= <fi. 

Proof. We prove the equivalence of {l) w and {2) w by induction on w. Con- 
sider first the case w = 1 . By Definition [HI there are no moves in the game 
EFi(A, B), and player I wins if and only if there is an atomic formula 
such that either (A,B) \= 4> or (A,B) \= Since w(0) = w(->0) = 1, we 
have (l)i =^> (2)i. On the other hand, if w(0) = 1, then cf) is, up to logical 
equivalence, either an atomic formula, or the negation of an atomic formula. 
Thus, (2)x implies (l)i. 
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Let us then consider the case w > 1, and assume (1)„ •<=>- (2) v for all 
v < w as an induction hypothesis. To prove (l) w -<=>- (2) w , assume first 
that player I has a winning strategy in the game EF W (A, B) . We have the 
following four cases according to the type of the first move in the winning 
strategy of player I. 

Case 1: Player I makes a left splitting move by choosing u, v, C and V such 
that 1 < u,v < w, w — u + v and A = C U V. Since this move is given 
by a winning strategy, player I has a winning startegy in both of the games 
EF U (C,B) and EF V (V, B). By induction hypothesis, there are formulas ip 
and 9 such that w(lp) < u, w(0) < v, (C, B) |= ip and (£>, B) |= 0. Thus, 
(21, a) |= V for all (21, a) G C and (21, a) |= for all (21, a) G V, whence 
(21, a) |= ip V for all (21, a) G A On the other hand, for all (23, /3) G B, 
(58, and (05, /3) ^ 9, whence consequently (23, /3) ^ V V 0. It follows 

that (A, B) |= i) V 0. As w(^ V 0) = w(^) + w(0) < it + u = w, (2) w is true. 

Case 2: Player I makes a right splitting move by choosing u, v, C and T> 
such that l<-u,f<w, w = -u + f and B = C U Z>. Since this move is given 
by a winning strategy, player I has a winning startegy in both of the games 
EF U (A,C) and EF V (A,V). By induction hypothesis there are formulas ip 
and 9 such that w(V>) < u, w(0) < v, (A,C) 1= *l> and (A V) \= 9. Thus, for 
all (21, a) G A, (21, a) |= V and (21, a) \= 9, whence (21, a) f= ^ A 0. On the 
other hand, (23, /3) ^ ^ for all (23, (3) E C and (23, 0) ^ for all (23, (3) E V, 
whence (23, (5) ^ V A 9 for all (23, 0) G B. It follows that (A, B) (= -0 A 0. As 
w(?/> A 0) = w(^) + w(0) < -u + f = w, (2) w is true. 

Case 3: Player I makes a left supplementing move by choosing a natural 
number j and a choice function F for A The next position in the game is 
then (w — 1, A(F/j), B(*/j)). Since this move is given by a winning strategy, 
player I has a winning strategy in the game FiF w -i(A(F/j),B(*/j)). By 
induction hypothesis there is a formula ip such that w(^) < w — 1 and 
(A(F/j), B(*/j)) \= if). Let be the formula 3xjip. Then w(0) = w(^) + 1 < 
w, and it suffices to show that (A, B) |= 0. Note first that for all (21, a) E A, 
(%a(a/j)) \= ip, where a = F((2l, a)). Thus we have (21, a) \= <p for all 
(21, a) E A. On the other hand, for all (23, (3) E B and all b E B, we have 
(23,/3(6/j)) whence 

Case 4: Player I makes a right supplementing move by choosing a natural 
number j and a choice function F for B. The next position in the game is 
then (w — 1, A(*/j), B(F/j)). Since this move is given by a winning strategy, 
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player I has a winning strategy in the game EF w _i(A(*/j),B(F/j)). By 
induction hypothesis there is a formula if) such that w(^>) < w — 1 and 
(A(*/j), B(F/j)) |= ip. Let be the formula Wxjip. Then w(0) = w(-0) + 1 < 
w, and it suffices to show that (A, B) |= 0. Note first that for all (21, a) G A 
and all a G A, we have (21, a(a/j)) \= if). Thus we have (21, a) \= for all 
(21, a) G A On the other hand, for all (23, /3) G £>, («B,/3(6/j)) ^ V, where 
6 = F((23, /?)). Thus we have (OS, /3) ^ for all (03, 0) G £. 

To prove the converse implication (2) w ==>- (l) w , assume that there is a 
formula of size < w such that (.4, B) \= 0. We show that then player I has 
a winning strategy in the game FF W (A, B). As in the case of propositional 
logic, we can assume without loss of generality that is in negation normal 
form. The strategy of player I depends on as follows. 

Case 1: is an atomic formula or the negation of an atomic formula. Then 
by Definition|9l player I wins the game EF„,( V 4, B) without making any moves. 

Case 2: is if> V 9. Let C be the class of structures (21, a) G A such 
that (21, a) \= if), and let V be the class of structures (21, a) G A such that 
(21, a) |= 9. Since (21, a) (= for every (21, a) G A, we have A = C U V. 
Moreover, since (23, /3) ^ for every (58, /5) G i3, we have (C, £>) |= -0 and 
(X>, £>) |= 9. Finally, as w(0) < w, there are u and v such that iu = u + v, 
w(ip) < u and w(#) < v. Note that u,v < w, and so, by induction hypothesis, 
player I has a winning strategy in the games FF U (C, B) and EF„(D, B). Thus, 
using w = « + u,^l = CUl , as his first move and these winning strategies in 
the rest of the game, player I is guaranteed to win the game FF W (A, B). 

Case 3: is if) A 9. Let C be the class of structures (*B, /3) G B such 
that (53, /3) y= ift, and let X> be the class of structures (58, /5) G i3 such that 
(23, /3) ^ 0. Since (93, 0) ^ for every (23, /3) G £, we have £ = C U £>. 
Moreover, since (21, a) \= for every (21, a) G ^4, we have (.4, C) |= -0 and 
(^4, V) |= 6*. Finally, as w(0) < w, there are u and w such that w = u + v, 
w(if>) < u and w(#) < v. Note that u,v < w, and so, by induction hypothesis, 
player I has a winning strategy in the games FF U (A, C) and EF„(.4, V). Thus, 
using w = u + v,A = CUVm his first move, and these winning strategies 
in the rest of the game, player I is guaranteed to win the game EF W (A, B). 

Case 4: is 3xjip. Since (21, a) \= for every (21, a) G A, there is a choice 
function F for „4 such that (21, a(F((2l, a))/j)) |= ^ for all (21, a) G A. 
Thus, (21, a*) |= -0 for every (21, a*) G A(F/j). On the other hand, for all 
(23, P) G B we have (23, /3) ^ 0, whence (23,/3(6/j)) ^ ^ for all b E B. 
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In other words, (58, ^ ift for all (58,/?*) G B(*/j). Thus we conclude 
that (A(F/j),B(*/j)) (= V- Note that w(^) = w(0) - 1 < w - 1, and 
so, by induction hypothesis, player I has a winning strategy in the game 
EF w -i(A(F/j), B(~k/j)). Thus, starting with the left supplementing move j 
and F, and using this winning strategy in the rest of the game, player I is 
guaranteed to win the game EF W (A, B). 

Case 5: (ft is Vxjift. Since (58, ($) \t= (ft for every (58, (3) G B, there is a choice 
function F for B such that (58, £(F((58, V= i> for all (58, /?) G B. 

Thus, (58,/?*) ^ V for ever y (®>/3*) e B(F/j). On the other hand, for all 
(21, a) G A we have (21, a) |= (ft, whence (21, a(a/j)) |= ip for all a G A 
In other words, (21, a*) \= ift for all (21, a*) G A(*/j). Thus we conclude 
that (A{*/j),B{F/j)) (= V- Note that w(^) = w(0) - 1 < w - 1, and 
so, by induction hypothesis, player I has a winning strategy in the game 
EF w _ 1 (A(*/j),B(F/j)). Thus, starting with the right supplementing move 
j and F, and using this winning strategy in the rest of the game, player I is 
guaranteed to win the game EF^(^4., B). Q.E.D. 

Corollary 11 Suppose K is a class of models of the same vocabulary. Then 
the following conditions are equivalent: 

(1) /C is the class of models of a first order sentence (ft of size < w. 

(2) Player I has a winning strategy in the game EF W (A, B) for all A and B 

such that A C /C and B fl JC — 0. 

Proof. Suppose fC is the class of models of (ft and w(0) < w. Let A and B 
be given so that A C K and B fl 1C = 0. Thus every model in A satisfies (ft 
and no model in B does, whence (^4, B) \= (ft. By Theorem [TOj player I has 
a winning strategy in FF W {A,B). For the converse, suppose player I has a 
winning strategy in FF W (A, B) for all (^4, B) with A C /C and i3 fl /C. This 
holds in particular if A = K, and B is the complement of /C. Thus for this 
choice of A and £> we have (.A, B) \= (ft for some (ft of size < w. Thus /C is the 
class of all models of (ft. Q.E.D. 

We get the following method for showing that a model class K. is not 
definable by a first order sentence of size < w: We find classes A and B such 
that 

1. AQK. 
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2. sn/c = 0. 



3. Player II has a winning strategy in EF W (A, B). 

A Game for Existential Formulas 

The game EF„, can be used for solving questions of the type 

Given a property of V models, what is the size of the smallest 
first order formula that defines VI 

But in some applications we are interested in definability by some restricted 
type of of formulas, rather than arbitrary first order formulas. For example, if 
a property V of models is known to be definable by an existential formula, it 
is natural to ask, what is the size of the smallest existential formula defining 
V . We will now define a variant of the game EF W that can be used in studying 
this question. 

Here we say that a first order formula is existential if it is in negation 
normal form, and it does not contain any universal quantifiers. In other 
words, existential formulas are built from atomic formulas and negations of 
atomic formulas by using the connectives V, A and the quantifier 3. 

Intuitively, in the definition of the game EF W , left and right splitting 
moves correspond to the connectives V and A, while left and right supple- 
menting moves correspond to the quantifiers 3 and V. Thus, we obtain a game 
for existential formulas simply by dropping right supplementing moves. 

Definition 12 Let A and B be classes of structures of the same relational 
vocabulary with dom(A) = dom(£>) ; and let w be a positive integer. The 
existential game EFl(A,B) has the same rules as EF W (A, B) , except that 
player I is not allowed to make right supplementing moves. 

Theorem 13 Suppose (A, B) is a pair of classes of structures of the same 
vocabulary, and let w be a positive integer. Then the following conditions are 
equivalent: 

(1) w Player I has a winning strategy in the game EF^(^4, B). 

(2) w There is an existential formula <j) of size < w such that (A,B) \= <fi. 
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Proof. The claim is proved by a straightforward adaptation of the proof of 
Lemma [TQJ In the proof of (l) w ==>• (2) w it suffices to note that omitting 
Case 4, the formula <ft separating the classes A and B will always be existen- 
tial. Similarly, in the proof of the opposite implication, Case 5 never occurs 
as the separating formula is existential, whence the winning strategy of 
player I does not use right supplementing moves. Q.E.D. 

As in the case of the full first order game EF^, we get again a method 
for showing that a model class /C is not definable by an existential formula 
of size < w: We find classes A and B such that 

1. A QIC. 

2. £n/C = 0. 

3. Player II has a winning strategy in EF^(^4, B). 

5 The Existential Complexity of Non-Empti- 
ness of Boolean Combinations 

We observed in the introduction that non-emptiness of all Boolean combina- 
tions of n unary predicates can be defined with a sentence of size 8n + 4 if 
a quantifier alternation is allowed. If only existential sentences are allowed, 
this can be expressed with a sentence of size (n + l)2 n . We use now our game 
for existential formulas to show that {n + 1)2™ is the best possible value. 

For each binary string s G {0, 1}™, let *B S be the {Pi, . . . , P n }-structure 
such that for each r G {0, 1}™, the Boolean combination of 
corresponding to r contains exactly two elements, b r and c r , except that the 
combination corresponding to s is empty. Furthermore, let 21 be the structure 
in which each Boolean combination contains exactly one element; let a r be 
the element in the Boolean combination corresponding to r G {0, l} n . Let 
Aq be the class {(21,0)}, and let B be the class {(*B S ,0) : s G {0, l} n }. 
We will show that player II has a winning strategy in the existential game 
EFl(A , B ) for all w < (n + 1)2™. 

Since the game is existential, in any position (u, A, B), the set A contains 
only one structure (21, a), where a is the variable assignment that results 
from the moves of player I. We say that a structure (Q5 S , (3) is flawless (with 
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respect to a), if dom(/3) = dom(a), and for all j G dom(a) and all r G {0, 1}", 
we have 

a(j) = a r = b r . (10) 

Note that if a s G ran(a), then there is no (3 such that (23 s , 0) is flawless. On 
the other hand, if a s g" ran(a), then there is a unique (3 such that (*8 S , 0) is 
flawless; we denote this (3 by (3 s>a . 

Furthermore, we say that a structure (Q5 S , /?) is good enough (with respect 
to a), if it is not flawless, but dom(/3) = dom(a), and (jTOl) holds for all j G 
dom(a) and all r ^ s, and there is a string t G {0, l} n with |{z : Sj ^ ti}\ = 1 
such that 

oc{j) = a s <^ f3(j) = ct (11) 

for all j G dom(a). Thus, if a s G ran (a), then for each t G {0, l} n with 
\{i : Sj ^ ti}\ — 1 there is a unique (3 such that (93 s ,/3) is good enough; we 
denote this (3 by s ,t,a- 

For the rest of this section, B will always denote a set of structures of the 
form (53 s , /3), and ^4 will denote a singleton set {(21, a)}. The measure of the 
set £> is defined to be 

M(B) = (n + l)-f(B)+g(B), 

where f(B) is the number of flawless structures in B and g{B) is the number 
of good enough structures in B. 

Lemma 14 If M(B) > 1, then there is no atomic formula <fi such that 
(A,B) |= <p or (A,B) |= 

Proof. If M(B) > 1, then either there is a flawless structure in B, or there 
are at least two good enough structures in B. If (Q3 S , (3) G 25 is flawless, 
then by condition ( flQj) . (21, a) and (<B S , (3) satisfy the same atomic formulas, 
whence no atomic formula separates A and B. 

Assume then, that (*B S , f3 s ,t,a) an d (25 s '> Ps',t',a) are two distinct good 
enough structures in B. Let be an atomic formula. If <fi is an identity 
%j — %k (with j, k G dom(a)), then it follows easily from fllOp and ( II ip that 
(21, a) |= cf) -<=>- (*B S , /3 Sj t >a ) |= 4>. Thus, 4> does not separate A and £>. 

Consider then the case = Pi(xj), where j G dom(a). Let a(j) = a r . If 
r/s, then s ,t,a(j) = b r , and we have 

(21, a) |= 4> r , = 1 « (».,&,*,«) 1= 0- 
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Similarly, if r 7^ s', then (21, a) |= -<=>- ( OS s /, >t > t0l ) \= 0. Assume finally, 
that r = s = s'. Since (5B S , p 8 ,t,a) i^s', Ps',t',a), we have £ 7^ Moreover, 
since |{i : r; 7^ = \{i : 7^ ^}| = 1, either r\ = ti or r/ = tj. Thus, it is 
not possible that separates (21, a) from both (<B S , (3 s ,t,a) and (5v, s ',t',a)- 
We conclude that in all cases (.A, i3) and (A, i3) ^ -i0. Q.E.D. 

Lemma 15 (a) If B — C U V, then M(C) + Af(D) > M(B). 

f&J J/ A! = A(F/j) and B' = B{*jj), then M(B') > M(B) - 1. 

Proof, (a) If B = C U £>, then obviously /(C) + f(V) > f(B), and c/(C) + 
g{V) >g{B). Hence we have M{C)+M{V) = (n + l){f{C)+f{V)) + {g{C) + 
g{V)) > M(B). 

(b) Let F((2l,a)) = a r . Thus, „4' = {(21, a')}, where a' = a(a r /j). 
Observe first that if /3 SjQ ,) is a flawless structure, and r 7^ s, then 
(5B S , /3 S a {b r /i)) = (25 S ,/3 SQ /) is also flawless, and clearly (*B,/3 SQ ,) G £> -<=>- 
(55,MeB'. 

Assume then that (5B S , /3 S)t)C( ) is a good enough structure. If r 7^ s, then 
(5B S , fis,t,a(p r / j)) = (5$sj Ps,t,a') is also good enough. On the other hand, if 
r = s, then (53 s , (3s,t,a(c t / j)) = (23 s , /3a,t,a0 is good enough. In both cases, 

(»,&,*,«) e6 <^ (!8,MeS' 

Thus, we see that if £> does not contain a flawless structure of the form 
(53 r , p rt a), then /(i3') = /(£>) and g(B') = g(B), whence the claim is true. 

Assume finally, that there is a flawless structure (23 r , /3 r a ) in B. Since 
a'(j) = a r , no structure (JB r ,(3) is flawless with respect to a'. On the other 
hand, for each t G {0,1}™ with \{i : 7^ = 1, there is a new good 
enough structure (<B r , (3 r , a (c t /j)) = (23 r , r ,t,a') in Thus, in this case 
/(£') = f{B) - 1 and g(B') = g(B) + n, whence M{B') = (n + l){f(B) - 
1) + g{B) + n = M(B) - 1. Q.E.D. 

Lemma 16 Ifw< M(B) , then player II has a winning strategy in EF^ (A, B) . 

Proof. We prove the claim by induction on w. Consider first the case 
w — 1. By the definition of the game EF^(A., £>), there are no moves, and 
player I wins only if there is an atomic formula such that (A, B) \= or 
(A, B) |= -i0. Since M{B) > w — 1, by Lemma [HI there is no such 0. 

Assume then that w > 1, and the claim is true for all u < w. Since 
M(B) > w > 1, by Lemma UM again, player I does not win the game without 
making moves. Consider then the options of player I for his first move. 
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Making a left splitting move A = C UV is not possible, since A is a singleton 
{(21, a)}. Suppose then that player I makes a right splitting move w = u + v 
and B = CUV. Then by Lemma Gj^a), M{C) + M{V) > M(B), and since 
w < M(B), either u < M(C), or v < MiV). If u < M(C), then by induction 
hypothesis, player II has a winning strategy in the game EFl(A,C). Similarly, 
if v < M(T>), then player II has a winning strategy in the game EF V (A,T>). 
Thus, by choosing the appropriate position (it, A, C) or (v, A, V), player II is 
guaranteed to win the game EF^(.A, B). 

Suppose then that player I starts with a left supplementing move j and 
F, where F is a choice function for A. The next position in the game is then 
(w - 1, A', B'), where A' = A(F/j) and B' = B(*/j)). By Lemma E^b) and 
our assumption w < M(B), we have w — 1 < M{B) — 1 < M(B'), whence by 
induction hypothesis, player II has a winning startegy in the continuation of 
the game EFl(A, B) from position (w - 1, A', B') onwards. Q.E.D. 

Consider now the classes Aq and Bq defined in the beginning of this 
section. Since the variable assignment in the only structure in Aq is empty, 
all the 2 n structures (Q5 S , 0) in B are flawless. Thus, M(B ) = (n+l)f(B ) + 
g(Bo) = (n + l)2 n , and by Lemma [TB| player II has a winning strategy in the 
game EF^^o, £>o) whenever w < (n + l)2 n . As all Boolean combinations of 
the predicates P\, . . . , P n are non-empty in 21, but each structure in Bq has 
an empty Boolean combination, we get the desired lower bound result: 

Corollary 17 If (p is an existential first order sentence expressing the prop- 
erty that all Boolean combinations of n unary predicates are non-empty, then 
the size of (p is at least {n + l)2 n . Q.E.D. 



6 The Existential Complexity of the Length 
of Linear Order 

As we noted in the introdution, for each n there is first order sentence <p n 
of logarithmic size with respect to n which expresses the property that the 
length of a linear order is at least n. However, the sentence <fi n has an 
unbounded number of quantifier alternations. In this section we show that 
2n — 1 is the minimum size of an existential sentence expressing this property. 

To prove the upper bound, define the following sequence of existential 
formulas: 
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■02 : = ^Xi3x 2 (x 1 < X2), and 

V>n+i : = 3x n+ i(V'n Ai„< x n+ i) for all n > 2. 

Clearly ^„ is true in a linear order if and only if its length is at least n, and 
an easy induction shows that w (■?/>„) — 2n — 1. 

To prove the lower bound, we will use again the existential game EF^. Let 
Ao = {(21, 0)}, where 21 is a linear order of length n, and let B = {(23, 0)}, 
where B is a linear order of length n — 1. Our aim is to show that player II 
has a winning strategy in the game EF^(«4o, £>o) for all w < 2n — 1. 

Consider a position (u, A, B) in the game 

EF^(A),£ ). Since the game is 
existential, A consists of a single structure (21, a). Let a\ < a • • • < a a; be the 
elements in ran(a), and let a and ai +1 be the least and the largest element 
in 21, respectively. We say that a variable assignment ft in 23 is acceptable 
(with respect to a) if dom(/3) = dom(o;), there are elements 61 ■ ■ • <® bi 
such that ran(/3) = {b\, . . . , b{\ and for all i € {1, . . . , /} and all j G dom(o;) 

a(j) = a t ^ f3(j) = b t . (12) 

In other words, (3 is acceptable if and only if the mapping a(j) h-> ft (J), 
j G dom(o;), preserves the relation <. 

Furthermore, we say that ft is nice (with respect to a), if it is acceptable, 
and in addition \{i < I : rf(aj,a i+1 ) ^ d(bi,b i+1 )}\ = 1, where d(x,y) is 
the distance between x and y in the given linear order, and 60 an d 
are the least and the largest element in 03, respectively. The distance from 
defect of ft is defined by 6 (ft) = d(bi,bi + i), where i < I is the unique index 
such that d(ai,a i+ i) 7^ d(bi,b i+ i); we denote this index by i(ft). Note that 
d(bi,b i+1 ) = d(ai,a i+ i) — 1 for i = i(ft). Note also, that for each i < I there 
is exactly one nice ft such that i(ft) = i. 

For the rest of this section, we assume that A is a singleton set {(21, a)} 
and B is a set of structures of the form (53, ft). The niceness measure of i3 is 
defined to be 

JV(B) = £(2*(/3) + l), 

/3eAT 

where A/" is the set of all nice variable assignments /3 such that (23, ft) G B. 

Lemma 18 If N(B) > 1, then there is no atomic formula <j) such that 
(A, B) |= or (A, B) |= -10. 
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Proof. If N(J3) > 1, then either there is a nice assignment G TV such that 
6(0) > 1, or there are two distinct nice assignments 0, 0' G Af. Assume first 
that is a nice assignment in Af, and S(/3) > 1. Then there are elements 
a\, . . . , ai, b\, . . . , bi such that ran(a) = {a\, . . . ,ai\, ran(/3) = {6i,...,6/}, 
ai < a ■ ■ • < a a, and &i ■ ■ ■ < s 6/. Since (3 is nice, = 6(0) > 1 

for i = i((3), and d(bi,b i+ i) = d(ai,a i+ i) > 1 for all other i < I. Thus, we 
have &i < <B • ■ ■ < <B 6/. It follows easily from condition (|12p that (21, a) and 
(53, /3) satisfy the same atomic formulas, whence no atomic formula separates 
A and B. 

Assume then that (3, (3' G A/", and 0^0'. As noted above, this means 
that i(0) 7^ i(0')- Let ai, . . . , aj be as above, and let ran(/3) = {b\, . . . , b{) and 
ran(/3') = . . . , b[} with b x • • • < B 6; and 6; <*...<» &J. As above, 
we see that 6j <® fej+i for all i < I except i = i(0), and similarly b\ < <B b' i+1 
for alii < I except i = i(f3'). Moreover, since i((3) ^ i(0'), for all i,j < I, we 
have either (a$ < a a, < <B 6j) or (a, < a •<=>■ b\ < <B 6^). Using 

condition f lT2|) . it is now easy to see that no atomic formula separates the 
sets A and B. Q.E.D. 

Lemma 19 (a)IfB = CUV, then N(C) + N(V) > N(B). 

(b) If A' = A(F/j) and B' = B(*/j), then N(B') > N(B) - 1. 

Proof, (a) Assume that B = C U T>. Let Af, O and V be the sets of nice 
assignments (3 such that (03, (3) G £>, (03, /3) G C and (03, /3) G T>, respectively 
Then we have 

N(B) = £(2*(/?) + l) 

< 52(25(0) + 1) + J>5(/3) + !) = ^(C) + # 

(b) Let A/" and Af 1 be the sets of nice such that (03, 0) is in i3 and B', 
respectively. Let ran(a) = {ai, . . . , a{\ with ai < a • • • < a a/, where a is 
the assignment such that A = {(21, a)}. As usual, we denote by ao and a^ + i 
the least and the largest element in 21, respectively. Furthermore, we denote 
F((Ql,a)) by c. 

Assume first that c = a(k) for some k G dom(a). It is easy to see that 
for any assignment 0, 

E Af -<=>- 0(d/j) G A/"', 
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where d = 0(k), and moreover 8({3(d/j)) = 5(0). Note also that if ^ 0', 
then 0(d/j) ^ 0'(d'/j). Thus, in this case we have N(B') = N(B). 

Assume next that c = a$ or c = ai + \. Then as above, we see that 
G Af •<=>■ 0(d/j) G A/"', where <i is the least or the largest element in B, 
respectively, and 5(/3(d/j)) = 6(0). Thus, also in this case we conclude that 
N(B') = N(B). 

Assume finally, that c {a , . . . ,a^ +1 }. Then there is an index h < I 
such that ah < >21 c < a^+i- Let be a nice assignment in A/", and let 
ran(/3) = {bi, . . . , b{\ with &i <® • • • <® 6/. As noted earlier, there is exactly 
one nice assignment 0h such that i(0h) — h. If ^ 0h, then d(bh,bh+\) = 
d(dh,ah+i), whence there is an element d such that d(bh,d) = d(a h ,c) and 
d(d, bh+i) = d(c, a h+ i). This means that 6(j3(d/j)) = 6(0), and clearly 0(d/j) 
is in Af'. 

On the other hand, if (3 = (3h, then d(bh, bh+i) = d(au, a^+i) — 1, and there 
are elements d and e such that d(bh, d) = d(au, c) — 1, d(d, bh+i) = d(c, ah+i), 
d(b h ,e) = d(a h ,c) and d(e,b h+1 ) = d(c,a h+1 ) - I. Let (3' = h (d/j) and 
P» = h (e/j). Then (3', (3" e Af', and we have 

5((3') + 6(0") = d(b h , d) + d(e, b h+l ) = d(a h , a h+l ) - 2 = 6((3 h ) - 1, 

whence 

(26(0') + 1) + (26(0") + 1) = 2(6(0 h ) - 1) + 2 = (25(/3 /l ) + 1) - 1. 
Thus, if 0h is in Af, we get 

N(B') = £(25(/3') + l) 

/9'eV' 

= ^ (2<f(/9) + 1) + ((2<f(&) + 1) - 1) = N(B) - 1, 

i 96AA\{ / 9 fe } 

and if h is not in Af, we have jV(B') = N(B). Q.E.D. 



Lemma 20 Ifw < N(B), then player II has a winning strategy in EF^(^4, B) . 

Proof. The proof is verbatim the same as for Lemma [16] just replace M(B) 
by N(B). Q.E.D. 

The proof of the lower bound result is also similar to that in the previous 
section. Consider the sets Aq = {(21,0)} and B = {(03,0)}. Clearly 5(0) = 
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n — 1, and so N(Bq) = 2{n — 1) + 1 = 2n — 1. Thus, by Lemma [2U1 player 
II has a winning strategy in the game EF^(^4 , B ) for all w < 2n — 1. Since 
the length of the linear order 21 is n, while the length of 03 is less than n, we 
have proved 

Corollary 21 7/0 zs an existential first order sentence expressing the prop- 
erty that the length of a linear order is at least n, then the size of is at least 
2ti-1. Q.E.D. 
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